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ABSTRACT 

This  paper  considers  generalized  equations,  which  are  convenient  tools 
for  formulating  problems  in  complementarity  and  in  mathematical  programming, 
as  well  as  variational  inequalities.  We  introduce  a regularity  condition 
for  such  problems  and,  with  its  help,  prove  existence,  uniqueness  and 
Lipschitz  continuity  of  solutions  to  generalized  equations  with  parametric 
data.  Applications  to  nonlinear  programming  and  to  other  areas  are  discussed, 
and  for  important  classes  of  such  applications  the  regularity  condition  given 
here  is  shown  to  be  in  a certain  sense  the  weakest  possible  condition  under 
which  the  stated  properties  will  hold. 
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SIGNIFICANCE  AND  EXPLANATION 

Many  practical  problems  in  operations  research,  in  structural  engineer- 
ing, and  in  other  fields  may  be  reduced  to  problems  in  mathematical 
programming  (minimizing  a function  subject  to  constraints)  or  in  complemen- 
tarity (a  type  of  equilibrium  formulation) . In  this  paper  we  discuss  a 
type  of  formula  called  a generalized  equation,  which  provides  a convenient 
mathematical  tool  for  expressing  problems  such  as  the  ones  just  mentioned . 

We  exhibit  a condition  under  which  the  solution  of  such  a generalized  equation 
will  be  locally  unique  and  will  have  a certain  type  of  stability  when  the 
data  of  the  problem  are  allowed  to  change  slightly,  and  we  argue  that  in  a 
sense  this  condition  is  the  best  possible  if  such  results  are  to  be  obtained. 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  author  of  this  report. 
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STRONGLY  REGULAR  GENERALIZED  EQUATIONS 
Stephen  M.  Robinson 

1.  Introduction.  This  paper  deals  with  general! zed  equations ; that  is,  inclusions 
of  the  form 


0 * f(x)  + 3ili  (x)  , 


(1.1) 


3il>c  (x ) : = 


where  f is  a function  from  a subset  D of  a normed  linear  space  X to  its  (topological) 
dual  X'  , C is  a nonempty  closed  convex  set  in  X,  and  the  notation  3i|;c  is  the 
normal  cone  operator: 

{y  e X'|y(c  - x)  < 0 for  all  c e c}  if  x e c, 

0 , if  x 4 C . 

Such  generalized  equations  conveniently  formulate  many  problems  of  interest  in  applica- 
tions, both  in  the  infinite-dimensional  and  in  the  finite-dimensional  cases.  Foi  example, 
n k £ 

when  X = F and  C is  of  the  form  F x F+,  with  k + £ = n,  then  (1.1)  can  be 
used  to  formulate  problems  in  complementarity  and  in  nonlinear  programming.  Examples 
are  given  in  [12]  as  well  as  in  Section  4 of  this  paper. 

In  [12],  we  studied  the  behavior  of  solutions  of  (1.1)  when  the  function  f was 
subjected  to  small  perturbations.  The  conditions  imposed  upon  the  generalized  equation 
in  [12]  were  such  as  to  permit  sets  of  solutions,  rather  than  isolated  or  unique  solutions. 
Of  course,  for  purposes  of  analysis  it  is  sometimes  convenient  to  deal  with  solutions 
which  are  unique,  or  at  least  unique  in  some  neighborhood.  Therefore  we  investigate 
here  the  problem  of  finding  appropriate  conditions  for  solutions  of  (1.1)  and  problems 
"close"  to  it  to  have  locally  unique  solutions,  and  for  those  solutions  to  have  good 
continuity  propel  ties  when  regarded  as  functions  of  the  perturbations  introduced  into 
(1.1).  Assuming  that  f is  Frechet  differentiable  in  some  neighborhood  of  a solution 
xQ  of  (1.1),  we  obtain  a condition  on  the  linearization  of  (1.1)  about  xQ:  i .e . , on 
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the  linear  generalized  equation 


0 e f (xQ)  + f ' (x  ) (x  - xQ)  + 8i|)c(xl  , (1.2) 

which  ensures  good  behavior  of  the  solutions  of  the  nonlinear  problem  (1.1).  This  condi- 
tion, which  we  call  strong  regularity , is  analogous  to  the  nonsingularity  condition 
imposed  in  the  usual  implicit-function  theorem  for  nonlinear  equations,  and  indeed  it 
reduces  to  that  condition  if  C is  the  whole  space  (so  that  (1.1)  reduces  to  f (x)  = 0) . 

We  show  also  that,  in  certain  sense,  these  conditions  are  the  best  possible  if  only  the 
information  contained  in  (1.2)  is  used. 

The  reader  will  see  in  what  follows  that,  throughout  the  paper,  the  linearization 
(1.2)  plays  a fundamental  role,  not  only  in  the  conditions  just  mentioned,  but  also  in 
the  sensitivity  analysis  of  (1.1)  with  respect  to  some  family  of  perturbations.  For 
example,  we  show  in  Theorem  2.3  below  that  (1.2)  contains  the  appropriate  local  informa- 
tion to  carry  out  a sensitivity  analysis  of  (1.1)  for  small  perturbations,  without 
solving  the  nonlinear  problem  (1.1)  over  again  for  each  perturbation  considered.  Thus, 
(1.2)  summarizes  important  first-order  information  about  the  behavior  of  (1.1)  near  xQ, 
in  a form  which  is  fairly  convenient  for  analysis  and  computation. 

The  arrangement  of  topics  in  the  rest  of  the  paper  is  as  follows:  in  Section  2 

we  define  strong  regularity  and  prove  the  main  results  about  the  nonlinear  problem  (1.1); 

in  Section  3 we  study  the  linearization  (1.2)  and  exploit  its  structure  in  specific 
cases  to  develop  criteria  for  strong  regularity  to  hold.  We  also  discuss  briefly  a 
reduction  procedure  which  is  very  useful  when  C is  polyhedral  (as  is  the  case  in 

important  areas  of  applications);  the  details  of  the  reduction  for  general  polyhedral  C 

are  deferred  to  the  appendix.  Finally,  in  Section  4 we  apply  the  results  previously 
developed  to  t'.  important  special  cases  of  complementarity  (briefly)  and  of  nonlinear 
programming  (in  more  detail).  In  particular,  we  show  that  satisfactory  analysis  of  a 
nonlinear  programming  problem  can  be  carried  out  without  the  usual  assumption  of  strict 
complementary  slackness. 
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We  close  this  introduction  by  indicating  briefly  some  similarities  and  differences 
between  this  paper  and  others  which  have  been  written  previously.  The  standard  implicit- 
function  theorem  was  used  in  [3),  [4],  [11]  to  analyze  perturbed  nonlinear  prograntninq 
problems  under  stronger  hypotheses  than  those  we  use  here.  Extensions  of  that  approach 
to  generalized  equations  will  be  reported  in  the  forthcoming  dissertation  of  A.  Reinoza 
in  the  Computer  Sciences  Department  of  the  University  of  Wisconsin-Madison.  Levitin  [7] , 
[8] , [9]  has  investigated  stability  aspects  of  infinite-dimensional  optimization 
problems,  and  Mangasarian  1 10]  has  obtained  uniqueness  results  in  linear  programming 
using  hypotheses  which  seem  to  be  closely  related  to  those  used  here. 

Finally,  we  mention  two  papers  treating  the  stability  of  convex  quadratic  programming 
problems:  Daniel  [2]  used  assumptions  similar  to  those  made  in  [12],  while  Hager  [5] 
imposed  conditions  similar  to  those  used  in  this  paper.  However,  Hager's  method  of 
analysis  is  quite  different  from  ours:  for  example,  he  assumes  that  unique  solutions 
will  exist  for  all  of  the  perturbed  problems  that  he  considers,  and  he  then  proceeds 
i to  investigate  the  continuity  properties  of  these  solutions.  We  prefer  to  identify 
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conditions  on  the  unperturbed  problem  which  will  permit  us  to  prove  that  solutions  of 
the  perturbed  problems  will  in  fact  exist,  and  which  in  addition  enable  us  to  analyze 
their  continuity  properties. 

Many  other  papers  have,  of  course,  been  written  on  stability  questions  related  to 
those  we  investigate.  The  above  papers  represent  only  a sample  of  the  literature;  many 
additional  references  can  be  found  in  them. 


2.  Strong  regularity  and  local  solvability.  In  this  section  we  define  a condition, 
called  strong  regularity,  which  can  be  satisfied  by  a generalized  equation  at  a solution 
point.  We  prove  a basic  solvability  theorem  which  says,  roughly  speaking,  that  if  a 
generalized  equation  is  strongly  regular  at  a solution  point  then  it  is  invertible  near 
that  point  and  the  inverse  function  is  Lipschitzian;  further,  any  generalized  equation 
which  is  close,  in  a suitable  sense,  to  the  one  with  which  we  are  working  will  share 
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these  desirable  properties.  We  then  show  how  this  basic  solvability  theorem  may  ho 
applied,  first  to  the  problem  of  parametric  sensitivity  analysis  and  then  to  the  exten- 
sion of  Banach's  inversion  lemma  to  linear  generalized  equations.  Both  of  these 
applications  depend  upon  strong  regularity,  which  we  now  proceed  to  define;  conditions 
to  ensure  that  strong  regularity  holds  at  a point  will  be  developed  in  Section  3.  The 
definition  and  results  given  here  are  stated  for  spaces  more  general  than  F^,  but  the 
material  in  the  remaining  sections  is  developed  for  Fn. 

DEFINITION:  Let  X be  a normed  linear  space,  and  let  ft  be  an  open  subset  of  X 
containing  a point  x^.  Let  C be  a closed  convex  set  in  X,  and  let  f : ft  -*•  X* 

(the  topological  dual  of  X)  be  Frechet  differentiable  at  xQ.  Suppose  that  the 
generalized  equation 

0 e f(x)  + 3iJ>c(x)  (2.1) 

has  xQ  as  a solution,  and  define,  for  x e X, 

Tx  :=  f(xQ)  + f*  (xQ) (x  - xQ)  + 3^c (x)  . 

We  say  that  (2.1)  is  strongly  regular  at  x^  with  associated  Lipschitz  constant  X if 
there  exist  neighborhoods  U of  the  origin  in  X'  and  V of  x^  such  that  the 
restriction  to  U of  T ^ v is  a single-valued  function  from  (J  to  V which  is 

Lipschitzian  on  U with  modulus  A. 

Note  that  if  C is  the  whole  space  X (so  that  (x)  = 0)  then  (2.1)  becomes 

f (x)  = 0,  and  strong  regularity  then  amounts  to  the  assumption  that  f'(x^)  * is  a 
continuous  linear  operator,  which  is  the  regularity  condition  that  one  would  normally 
impose  in  such  a case. 

We  can  now  prove  the  main  result,  a type  of  implicit-function  theorem  for  Generalized 
equations  satisfying  the  strong  regularity  condition. 

THEOREM  2.1:  Let  X,  X*,  C,  ft  and  x()  be  as  in  the  Definition.  Let  P be  a 
topological  space,  pQ  6 P,  and  let  f : P x ft  ->  X ' . Suppose  that  the  partial  Frechet 
derivative  of  f with  respect  to  the  second  variable,  written  f' (*,*),  exists  on 
P * ft,  that  both  f ( - , - ) and  f'(*,*)  are  continuous  at  (p^,x^) , and  that  solves 

0 e f (pQ,x)  + (x)  . (2.2) 
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If  (2.2)  is  strongly  regular  at  x^,  with  associated  Lipschitz  constant  A , then 
for  any  e > 0 there  exist  neighborhoods  Nf  of  and  W of_  xQ,  and  a single- 

valued  function  x : •*  , such  that  for  any  p f N^_,  x(p)  is  the  unique  solution 

in  Wc  of  the  inclusion 

0 e f (p,x)  + 3i(/c(x)  . (2.3) 

Further,  for  each  p and  q iri  N£  one  has 

||x(p)  - x(q)||  < (X  + K)  ||  f (p,x(q)  ) - f(q,x(q))||  . (2.4) 

PROOF:  Suppose  that  a positive  £ has  been  prescribed;  choose  a positive  number 
6 so  small  that  16  < e/(A  + e).  By  strong  regularity,  there  exist  neighborhoods  0 
of  the  origin  and  V of  xQ  such  that  if  we  define,  for  x e X, 

L(x)  :=  f(P0rXQ)  + f'(P0,x0)(x  - xQ)  + 3*c(x)  , 

then  the  restriction  to  U of  l ^ n V is  single-valued  and  Lipschitzian  with  modulus  X. 
Let 

r(p,x)  :=  f(PQ.x0>  + f'(P0,x0)(x  - xQ)  - f (p,x) 

for  (p,x)  e P x ft,  and  choose  a neighborhood  W of  p^  and  a closed  ball  V r of 

radius  p about  x so  that  V C V and  for  each  pew  and  x e v one  has 

o e e 

r(p,x)  c u and  ||f'(p,x)  - f'(PQ*XQ)||  - ^ • N°w  shrink  W,  if  necessary,  to  obtain 
a neighborhood  of  p^  with  N£  c W and  having  the  property  that  for  each  p € N , 

X II  f (po'  x0)  " f <P^xo)  II  = (1  " A<5)0  • 

Choose  any  p € N , and  define  an  operator  4>  from  V to  V bv 
e pc’ 

* (x)  :=  V n L_1(r(p,x)l  . 

Note  that  x e V£  n 4p(x)  if  and  only  if  x £ and  0 e f(p,x)  + 3^c(x). 

Now  let  x^,  x^  be  any  two  points  of  . We  have,  using  the  assumption  of  strong 
regularity, 

ll*p<xl>  -‘*’p(x2)ll  < A||r(p,x1)  -r(p,x2)|| 

i Allx1_  X2 II  SUP(  II  t 1 (p,  ( 1 - p)  x^  + UX2)  |j  Jh  < u ‘ 1} 

< X6||x1  - x 2 1 1 , 
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r ' ~ i 


since  r'(p,x)  = f 1 (Pq»xq)  - f'(p,x).  Since  A6  < 1,  is  strongly  contractiv0  on  . 

Further,  since  x^  * V H L ^(0)  and  ♦ (x^)  = V n l ^(r(p,Xg)],  we  have 

llyv  - x0ll  i A l(  r (p,  x0>|| 

= Mlf  (Pq'V  " f(p,x0)  II 

< (1  - A 6 ) p , 

and  therefore  for  any  x e V , 

e 

ll*p<*>  - *0II  < ||4>P(X)  - $p(x0)  ||  + ||»p(x0)  - XQ|| 

< A6  ||x  - xQ  ||  + (1  - A<5)p  < p , 

so  that  $ is  a self-map  of  V . By  the  contraction  principle,  <t>  then  has  a unique 
p e p 

fixed  point  x(p)  e V , and  for  each  x e one  has  the  bound 

1 1 x (p)  - x ||  £ (1  - A6)  1 1|  4>^ (x ) - x ||  . (2.5) 

It  follows  from  our  earlier  observation  that  x(p)  is  the  unique  solution  of  (2.3)  in 

V . We  have  thus  established  the  existence  of  the  function  x : N -*■  V ; to  obtain 

e c e 

the  bound  (2.4)  we  take  any  p and  q in  and  apply  (2.5)  with  x = x(q)  to  get 

II X (p)  - x (q)  ||  < (1  - A6)  1 1|  (x  (q)  ) - x(q)||  . 

If  we  now  recall  that  x(q)  = <J>^(x(q))  and  employ  the  bound 

||  *t>p  (x  (q)  ) - <*>q(x(q)  ) |!  < A ||r(p,x(q)  ) - r(q,x(q))|| 

= A ||  f (p,x(q) ) - f (q,  x (q)  ) ||  , 

we  have 

1 1 x ( p)  - x (q)  ||  < A(1  - A6)  1 ||  f (p , x (q)  ) - f(q,x(q))||  . 

Observing  that  A(1  - A6)  ^ < A + e,  we  find  that  (2.4)  holds,  so  that  the  proof  is 
complete . 

By  imposing  a more  restrictive  continuity  conditio  i we  can  show  that  x(*)  is 
locally  Lipschitzian  at  pQ. 

COROLLARY  2.2:  Assume  the  notation  and  hypotheses  of  Theorem  2.1.  Suppose  further 
that  P is  a subset  of  a normed  linear  space  and  that  for  some  constant  v and  for 
each  p,q  e and  each  x e V , one  has 
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||f(p,x)  - t (q,x)  jj  < v||p-  q||  . 


Then  x(-)  is  Lipschitzian  on  with  modulus  v(X  + c)  . 


PROOF:  Immediate  from  (2.4). 


It  may  be  worth  pointing  out  here  that  the  condition  of  strong  regularity  is  the 


weakest  possible  condition  which  can  be  imposed  on  the  value  of  a function  f and  its 


derivative  at  a point  x^,  so  that  for  each  perturbation  structure  satisfying  the 


hypotheses  of  Theorem  2.1  and  Corollary  2.2,  a function  x(-)  will  exist  having  the 


properties  stated  in  those  results.  To  see  this,  one  has  only  to  consider  a function 


f : fi  * X'  which  is  Frechet  differentiable  at  x^  and  which  satisfies 


0 e f (xQ)  + ai^tXg).  Let  P be  a neighborhood  of  the  origin  in  X',  and  let 


f(p,x)  :=  f(xQ)  + f'(Xg)(x  - Xq)  - p,  with  pQ  = 0.  Fix  some  positive  e.  If  neighbor- 


hoods and  V£,  and  a function  x(-),  exist  having  the  properties  asserted  in 


Theorem  2.1  and  Corollary  2.2,  then  with 


Tx  :=  f(xQ)  + f' (x  ) (x  - xQ)  + (x)  , 


we  see  that  the  restriction  to  of  T Ft  is  a single-valued,  Lipschitzian  func- 


tion: that  is,  the  generalized  equation  Of  f(x)  + 3ip^(x)  is  strongly  regular  at  x^. 


One  of  the  by-products  of  Theorem  2.1  is  a useful  formula  for  parametric  sensitivity 


analysis,  which  we  give  in  the  next  theorem. 


THEOREM  2.3:  Assume  the  notation  and  hypotheses  of  Theorem  2.1  and  Corollary  2.2. 


Then  for  each  c > 0 there  exists  a function  a : N -*■  R,  with  lim  a(p)  = 0, 

f^P0 

such  that  for  any  p e one  has 


I x (p)  - *p(x0)  ||  < a (p)  ||p-  pQ| 


Before  proving  this  result  it  may  be  helpful  to  interpret  it.  To  compute  $ lx  ) 


is  to  find  the  unique  solution  in  of  the  linear  generalized  equation 


Off  (p.xQ)  + f 1 (Pq.Xq)  (x  - xQ)  + 3't'c  (x)  , 


and  the  theorem  says  that  this  solution,  for  values  of  p near  p^,  will  be  very  close 


to  the  solution  x(p)  of  the  nonlinear  generalized  equation 


Off  ( p , x ) + 3^c  (x) 


\ 


V 


I 


It  is  easy  to  verify  that  in  the  case  C = X (i.e.,  when  on«  is  solving  f(p,x)  (j) 
this  corresponds  to  the  result  that  if  f(-,x  ) is  Frechet  d if ferentiable  at  pf  , then 
so  is  x ( • ) , with 


X,(P0>  = -f,(po'xo) 


-1  d 


3p  f(W 


In  many  applications,  one  might  find  (2.6)  significantly  easier  to  solve  than  (2.7) 
For  example,  in  finite-dimensional  applications  involving  nonlinear  problems  in 
complementarity  or  in  mathematical  programming,  (2.6)  is,  respectively,  a linear 
complementarity  problem  or  a quadratic  programming  problem.  Particularly  if  one  is 
interested  in  sensitivity  analysis  of  a computed  solution  to  a nonlinear  problem,  one- 
may  already  have  at  hand  much  of  the  information  needed  to  solve  (2.6)  quickly  for 
different  values  of  p (e.g.,  using  parametric  linear  complementarity  or  parametric 
quadratic  programming  techniques).  Thus,  in  such  cases  Theorem  2.3  could  provide  a 
relatively  cheap  way  to  find  a good  approximation  to  x (p)  for  p near  p^ . 

PROOF  OF  THEOREM  2.3:  We  know  that  x (p)  = <t>  (x(p));  thus  by  strong  regularity 

P 

we  have 

II  x (p)  - <*»p  (xQ)  ||  < A||r(p,x(p))  - r(p,xQ)|| 


X ||  x (p)  - xQ||  supi  ||r  ' (p»  (1  - p)x(p)  + ux  ) ||  0 < u < 1} 


< A v ( A + e)  ||  p - p0||sup{  ||  f ' (Pq'Xq)  - f ' (p,  (l-u)x(p)  + pxQ)||  0 < u < 1} . (2 


The  quantity  in  brackets  approaches  zero  as  p approaches  p(|,  by  the  continuity  of 
x(*)  and  of  f'  (*,*),  and  this  completes  the  proof. 

Note  that  if  we  knew  that  for  some  constant  8 and  all  p c P,  x e , w<_*  had 

||f(p,x)  - f'(p0.x0)||  < B ( II  p - P0 II  + ||  X - xQ  II ) , 
then  we  could  obtain  from  (2.8)  a bound  of  the  form 

II  X (p)  - ‘fptXg)  II  < Y II  P - P0  II  2 - 

for  some  constant  y. 
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We  illustrate  next  another  application  of  Theorem  2.1,  this  time  to  the  establish- 
ment of  an  analogue  of  the  Banach  perturbation  lemma  for  linear  operators. 

THEOREM  2.4:  Let  X be  a Banach  space,  let  a^  be  a point  of  the  dual  space  X', 
let  C be  a closed  convex  set  in  X and  let  AQ  belong  to  L(X,X’)  (the  space  of 

bounded  linear  operators  from  X to  X1).  Suppose  that  xQ  is  a point  of  X whuh 

satisfies  the  generalized  equation 

0 e AQx  + aQ  + 3ii>c  (x)  . (2.9) 

If  (2.9)  is  strongly  regular  at  xQ  with  associated  Lipschitz  modulus  > , then 
there  exist  neighborhoods  M o£  AQ  in  L (X,X ' ) , N of^  aQ  and  W of  the  origin 

in  X',  and  V o£  xQ,  such  that  if  for  A e M,  a e N and  x e V one  defines 

T(A,a,x)  :=  Ax  + a + 3iJ<c(x)  , 

then  T (A , a , • ) n V is  a single-valued  function  on  W and  is  Lipschitzian  there  with 

modulus  1(1  - A ||  A — Aq||  ) 1. 

PROOF:  Apply  Theorem  2.1  with  P :=  L(X,X')  x X',  pQ  :=  (A^,a^) , and 
f (p,x)  :=  Ax  + a (with  any  positive  e)  to  produce  neighborhoods  of 

and  V of  Xq,  with  a single-valued  function  x : -*■  V having  the  property  that 

for  each  (A, a)  e N^,  x(A,a)  is  the  unique  solution  in  V of  0 e Ax  + a + Sil^fx)  . 

Note  that  (2.4)  implies  ||x(A,a)  - xQ||  < (A  + e)  ||  (A  - AQ)x0  + (a  - aQ)  ||  for  such  (A, a). 
Choose  neighborhoods  M of  A^,  N of  a^,  and  W of  the  origin  in  X',  such  that 

(1)  for  each  A c M,  X || A - AQ||  < 1 , 

(2)  Mx  (N  - W)  C N , 

and 

(3)  for  each  Ac  M,  a c n and  y c W,  the  point  y + (AQ  - A)x(A,a)  + (a^  - a)  lies 

in  the  neighborhood  U of  the  origin  given  in  the  definition  of  strong  regularity  for  (2.9) . 

Evidently  for  any  Ac  M,  a c N and  y c w,  the  generalized  equation 

y c Ax  f a + 3^(x)  is  uniquely  solvable  in  V (by  x(A,a  - y)  ) . Let  y and  y^ 

belong  to  W,  and  let  and  x^  be  the  solutions  associated  with  y^  and  v9- 
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— 


— 





— 


Then  for  i = 1,2  we  have 

y.  + (A  - A)x.  + (a  - a)  € Ax.  + a + (x . ) . 
jO  lO  O1OC1 

Thus,  by  strong  regularity, 

llxx  - x2ll  i xli>y1+  (A0  _ A,xi  + <a0  ■ 3)1  ■ (y2  + (A0  " a)x2  + <ao  ' a)|H 
i xIIyl-  y2ll  + x II A0 ~ Ali  ll^i  - x2ll  • 

But  as  A || aq  — A ||  < 1,  we  finally  obtain 

Hxl  ‘ x2  II  S X(1  ' x II A0 ~ All)_1  llyi  - y2 II  < 

which  completes  the  proof. 

It  should  be  noted  that  this  result  says,  among  other  things,  that  if  a generalized 

equation  is  strongly  regular  at  a solution  x_ , then  any  sufficiently  "close" 

0 

generalized  equation  will  be  strongly  regular  at  its  solution  near  x^  (which  must 
exist  and  be  unique  by  Theorem  2.1);  further,  the  neighborhoods  involved  in  the  defini- 
tion of  strong  regularity  can  be  taken  to  be  the  same  for  all  nearby  generalized  equations. 
This  is  a property  which  is  not  available  under  the  weaker  hypotheses  used  in  (12). 

3.  Conditions  for  strong  regularity.  We  have  seen  that  nonlinear  generalized 
equations  can  be  expected  to  behave  in  desirable  ways  if  their  linearizations  are  strongly 
regular  at  the  points  in  question.  In  this  section  we  develop  a general  condition 
which  is  sufficient  for  strong  regularity,  as  well  as  a sharper  condition,  designed  for 
the  case  most  frequently  seen  in  applications,  which  we  show  to  be  both  necessary  and 
sufficient . 

To  begin  with,  we  suppose  that  we  are  considering  a generalized  equation  of  the 
form  Ot  Ax  + a + ^^(x)  , where  A is  n x n,  a e B.n,  and  C is  a nonempty 
polyhedral  convex  set  in  Fn.  Let  xQ  be  a solution  of  this  equation,  and  consider 
the  inclusion 

ye  Ax  + a + (x ) , (3.1) 

for  y near  0 and  x near  x^.  We  claim  that  this  can  actually  be  reduced  to  the 
consideration  of 
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z € Bw  + 3ili  (w)  , 

KrxK 


(3.2) 


for  some  non-negative  integers  r,s  with  r + s < n,  a square  matrix  B of  dimension 

(r  + s) , a pointed  polyhedral  convex  cone  K in  RS  of  dimension  s,  ana  points 

r+s 

z,w  near  the  origin  in  R . This  reduction  is  carried  out  in  detail  in  the  appendix 
to  this  paper,  but  here  we  indicate  how  it  may  be  done  in  the  case  found  most  frequently 
in  applications:  namely,  that  in  which  C = R^  x R^.  This  case  includes  the  standard 
linear  complementarity  problem  (1  = 0,  m = n)  and  the  problem  of  quadratic  programming 
without  implicit  constraints,  such  as  non-negativity  (with  i the  number  of  variables 
and  equality  constraints,  and  m the  number  of  inequality  constraints).  If  C = R*  x R^1 , 
we  know  that  for  i = i + 1,  . . . ,9.  + m = n we  must  have  (Ax^  + a)  >0,  with  eouality 
if  <x0).  > 0.  Assume  that  the  variables  and  the  elements  of  Ax^  + a have  been 
reordered,  if  necessary,  so  that 


for  i = i + 1,...,4  + j,  (Axq  + a) ^ =0  and  (xQ)  > 0 ; 

for  i=»£  + j + !,...,£  + j + s,  (Axq  + aK  = 0 and  (XgK  = 0 ; 


for  i 


j + s + l,...,n,  (Ax  + a) . >0  and  (x„) . = 0 . 

0 l Ox 


It  is  quite  clear  that  if  x and  y satisfy  (3.1),  with  x near  xQ  and  y near 

zero,  then  for  i=i+j+s+l,...,n  we  shall  have  (Ax  + a)^  > 0 and  thus  x^  = 0; 

similarly,  for  i = f + l,...,Jt  + j,  x.  >0  and  so  (Ax  + a)  . =0.  Thus  if  we  let 

i i 

r :=  t + j,  partition  elements  v e Rn  as  (v*,v2,v2)  with  v^  e Rr,  v2  e RS, 
and  partition  A conformably  as 


3 , _n-r-s 
v € R 


1 

An 

*12 

*13 

A = 

*21 

*22 

*23 

*31 

*32 

*33 

we  see  that  we  really  only  have  to  consider  solutions  of 

1 . 1 , 2 1 

y - Aux  ♦ A12x  ♦ a 

2 , 1 , 2 2 

y « A21x  ♦ A22x  + a 

2 2 1 2 2 2 

x >0,  (x  ,A21x  ♦ A22x  ♦ a - y > 
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4 


that  is,  of 


*«F 


*2i  *22  1“ 


T + 34>  _ 0 

2 r s 2 

F * F x 


However,  if  we  recall  that 


1 2 

Ax0  + a = A21  XC  + a 


A31X0  + 3 


and  that  = 0,  then  by  writing 


1 1- 

x - xo 

2 2 

x - xQ 


we  can  write  (3.3)  as 


AU  A12 


A21  A22 


W + dip 


TRZ*]RS 

+ 


and  with  z :=  2 this  is  in  the  form  (3.2).  This  reduction,  though  quite  simple, 

y Ki  AJ 

is  very  useful  in  identifying  that  portion  of  the  problem  (viz . , the  matrix  ( ) 

-A21  *22 

to  which  we  have  to  attach  conditions  in  order  for  the  original  generalized  equation  to 
be  strongly  regular  at  x^.  In  quadratic  programming  problems  with  linear  constraints 
but  without  implicit  constraints  on  the  variables,  this  reduction  procedure  amounts  to 
(1)  eliminating  the  constraints  which  are  inactive  at  the  point  in  question,  and  (2) 
regardinq  as  equations  those  (active)  inequality  constraints  whose  associated  multipliers 
are  positive. 

Before  proceeding  to  establish  conditions  ensuring  that  (3.4)  will  behave  well, 

we  comment  on  the  question  of  continuity  for  such  problems.  If  K is  any  polyhedral 
s 

convex  set  in  F , the  operator 


w + 3i \)  (w) 

Fr*K 


11 

A12 

21 

A22 

is  an  example  of  a polyhedral  multifunction  112],  and  T is  also  polyhedral.  We 

have  from  [12,  Prop.  21  that  for  some  fixed  X,  T * is  then  locally  upper  Lipschitzian 

r + s 

with  modulus  A at  each  point  yQ  e R : that  is,  for  each  such  yQ  there  exists 

a neighborhood  V of  yQ  such  that  for  each  ye  V, 

T_1(y)  C T_1(y0)  + A||y-  yQ||B  , 

where  B is  the  unit  ball.  It  is  not  difficult  tc  show  that  if  T 1 is  also  single- 

r+S  —1 

valued  on  a convex  set  D C R , then  T will  necessarily  be  Lipschitzian  there 


with  modulus  X.  Thus,  if  we  can  show  that  such  an  operator  is  single-valued  on  some 
convex  set,  we  can  conclude  immediately  that  it  is  also  Lipschitzian  there. 

In  the  following  theorem,  we  develop  conditions  for  single-valuedness  of  such  an 
operator.  We  use  the  idea  of  a Schur  complement:  if  a matrix  A is  partitioned  as 


A11  *12 


LA21  A22J 

with  A^  and  A^2  square  and  A^  nonsingular,  then  the  Schur  complement  of  A 
in  A,  written  (A/A  ) , is  defined  to  be  A - A A ^A  . An  interesting  treatment 
of  this  idea  may  be  found  in  [1] . 

THEOREM  3.1:  Let  r and  s be  positive  integers,  and  let  K C RS  be  a non- 
empty closed  convex  set.  Let  A be  an  (r+s)  « (r  + s)  matrix: 


I 


An  Ai2 

A = , 

*21  *22 

r+s 

where  _is  r * r.  For  w e R define 

Tw  Aw  + Sib  (w)  . 

Kr*K 

“1  r+s 

For  T to  be  a Lipschitzian  function  defined  on  all  of  R , it  suffices  that: 


(1)  A be  nonsingular 


(2)  (A/A^)  be  positive  definite. 


4 


In  the  special  case  in  which  K = , (2)  may  be  replaced  by 

(2')  (A/A^)  have  positive;  principal  minors, 

and  conditions  (1)  and  (2')  are  then  both  necessary  and  sufficient. 

PROOF:  For  the  first  assertion,  suppose  that  (1)  and  (2)  hold.  Let 


vl  :=  ■ - v[  f Rr.  y\  f ks-  i = 1,2  , 

y-. 


and  consider  the  system 


This  is  equivalent  to 


y1  e Aw1  + 3*  (w1) 

Rr>K 


*1  = AllWi  + A12W2 

y2  6 A21W1  + A22W2  + 3*k(W2)  * 


As  A is  nonsingular,  we  can  see  that 


W1  = AUyi  - AIlA12W2  ' 


so  that  (3.5)  holds  if  and  only  if  (3.6)  holds  and 


Zl  :=  y2  ' A21Aiiyi  £ (A/AU)W2  + 3V(W2) 


The  operator  defined  on  J!  by 


S(w)  :=  (A/A^Jw  + dip^(w) 


is  maximal  monotone;  as  (A/A^)  *s  Positive  definite,  S(w)  is  also  strongly  monotone, 
and  thus  its  inverse  is  Lipschitzian  on  all  of  »S.  Thus,  for  some  fixed  L, 

llw2  ' *2^  :lIIz1'z2II  1 Lll  I_A2iAn  I5II  lly1  - y2 II  . 

This  inequality,  together  with  (3.6),  implies  that  for  a constant  M independent  of 

1 , 2 
y and  y , 

i I w 1 - W2  II  < M [Jy1  - y2 1|  , 
which  proves  the  sufficiency  of  (1)  and  (2). 


Now  assume  that  K = B+  and  replace  (2)  by  (2').  For  sufficiency,  note  that 
(3.7)  is  the  linear  complementarity  problem 

<A/,A11)W2  ~ z*  = 0 

w*  > 0 (3 

< , (A/A^) w2  * zl * = 0 > 

where  < •,■>  denotes  the  standard  inner  product.  It  is  known  [6]  that  (3.8)  has  a 

i s 

unique  solution  for  each  z e B if  and  only  if  (A/A^)  has  positive  principal 
minors.  In  view  of  v 3 . C- ' and  our  earlier  comments  about  polyhedral  multifunctions, 

this  is  enough  to  prove  sufficiency.  For  necessity,  suppose  that  T ^ is  Lipschitzian 

r - s r 

on  all  of  F . Ir  A^  is  singular,  let  u be  a point  of  F not  in  the  range 

s 

of  A^  . bet i * * *  v be  a point  of  F with  all  of  its  components  strictly  negative,  and 


for  any  non-negative  >.  define  y^  :=  |v  j.  We  know  that  yQ  e T(0)  , and  by  the 
assumed  properties  of  T we  know  that  for  all  small  positive  A the  point 


will  be  near  the  origin.  However,  we  have 


V - A21WJ  - A22W2  f 3*  S<W2>  ' 


and  for  small  ) the  left-hand  side  will  be  strictly  negative  in  all  components, 
implying  that  w^  = 0.  But  then 

Xu  = AnwJ  + A12w^  = AuwJ  , 

contradicting  our  choice  of  u.  Thus  A^  must  be  nonsingular,  so  (A/A  j.)  is  well 

—1  s 

defined.  Now  by  our  previous  analysis  T (y)  will  be  a singleton  for  each  y e B 

i s 

if  and  only  if  the  complementarity  problem  (3.8)  is  uniquely  solvable  for  each  z f B . 

We  have  already  remarked  that  this  is  equivalent  to  (2')>  and  this  completes  the  proof. 

We  observe  that  if  K is  a cone  in  Theorem  3.1,  then  the  operator  T is  positively 

homogeneous  ( i.e . , for  X > 0,  T(Xx)  = XT(x)).  In  that  case,  for  T * to  have  a 


4 


property  such  as  unique1  solvability  on  all  of  P 


xt  is  necessary  and  sufficient 


that  that  property  hold  on  any  neighborhood  of  the  origin.  Also,  the  conclusions  of 
Theorem  3.1  are  not  invalidated  if  either  of  the  spaces  pt  and  P^  does  not  appear 
(i.e.,  if  the  set  pr  * K is  P*  itself,  or  is  K) . In  fact,  the  arguments  are 
simplified  in  that  case;  of  course,  the  obvious  changes  in  the  conditions  (1),  (2)  and 
(2')  must  be  made. 

The  following  corollary  relates  the  conditions  just  developed  to  the  property  of 
strong  regularity  which  we  used  in  Section  2. 

COROLLARY  3.2;  Let  C be  a polyhedral  convex  set  in  pr , let  A be  n * n , 
a e pn,  and  let  xQ  solve 

0 e Ax  + a + (x>  . (3.9) 


Let  the  reduced  form  of  (3.9)  at  x^,  if  not  vacuous,  be 


0 e Bw  + (w)  , 

P*K 


B B 
11  12 


B B 
21  22 


(3.10) 


where  K is  a polyhedral  convex  cone  in  P3 . For  (3.9)  to  be  strongly  regular  at  x(J 
it  suffices  that  (i)  the  reduced  form  be  vacuous  or  that  (ii)  B ^ be  nonsingular  and 
(B/B  ) be  positive  definite.  If  K = P^  then  for  (3.8)  to  be  strongly  regular  at 
Xq  it  is  necessary  and  sufficient  that  (i)  the  reduced  form  be  vacuous,  or  (ii)  ^ 
be  nonsingular  and  (B/B^)  have  positive  principal  minors. 

PROOF:  Immediate  from  Theorem  3.1  together  with  Theorem  A. 4 in  the  appendix. 


4.  Applications  and  examples.  Having  developed  the  theoretical  aspects  of  strong 
regularity  in  Sections  2 and  3,  we  apply  these  developments  here  to  problems  in 
complementarity  and  in  nonlinear  programming.  We  first  give  an  example  from  complemen- 
tarity: consider  the  linear  complementarity  problem 

Ax  + a > 0 
x > 0 

( x ,Ax  + a > = 0 , 
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4 


A 


in  which 


i 

1 

l' 

-r 

A = 

4 

5 

1 

f a = 

-4 

7 

1 

1_ 

_ 0_ 

This  is  equivalent  to  the  generalized  equation  0 € Ax  + a + (x)  . The  matrix  A 

R 

is  neither  positive  definite  nor  a P-matrix  (i.e.,  a matrix  with  positive  principal 

• T 

minors),  but  the  problem  has  a solution  with  xQ  = (1,0,0).  Noting  that  the  third 

component  of  AxQ  + a is  positive,  as  is  the  first  component  of  xQ,  we  can  apply  the 

reduction  procedure  to  obtain  the  problem  in  reduced  form  as 


'l 

l" 

\ 

+ ^RxR 

V 

4 

5 

*2 

+ 

*2 

where  z^  ^ conditions  for  strong  regularity  thus  reduce  to  the  require- 

ments that  the  submatrix  (1J  be  nonsingular  and  that  its  Schur  complement  be  a P matrix. 

As  the  Schur  complement  is  [5)  - (4]  (11  ^[1]  = (1J , we  see  that  this  problem  is  strongly 
regular  at  x^.  The  reader  may  wish  to  check  that  if  the  entry  of  5 appearing  in  the 
matrix  is  changed  to  4,  the  problem  is  no  longer  strongly  regular  at  x^,  although 
remains  a solution.  In  fact,  with  that  change  one  finds  that  if  the  second  component 
of  a is  perturbed  to  -4-e  (e  > 0) , the  resulting  problem  cannot  have  any  solution 
near  x0>  although  it  has  the  solution  x^  =(0,1+  c/4,  0) . Even  with  e = 0 the 
solution  set  consists  of  the  line  segment  {(1  - A)(1,0,0)T  + 1(0,1, 0)T|0  < 1 *■  1), 
so  that  xQ  is  not  an  isolated  solution. 

We  next  examine  the  standard  nonlinear  programming  problem 

minimize  0 (x) 

subject  to  g(x)  < 0 (4.1) 


h(x)  = 0 , 


where  0 , 
into  R, 


g and  h are  Frechet  differentiable  functions  from  some  open  set  fl  C Rn 
R^  and  R^  respectively.  The  optimality  conditions  for  (4.1)  are 
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£ • (x,u,v)  = 0 
g(x)  < 0 
h (x ) =0 
u > 0 

< u,g  (x)  > = 0 , 


(4.2) 


P Q 

where  u and  v are  points  in  R and  R respectively,  and  where 


£(x,u,v)  :=  6 (x)  + <u,g(x)>  + <v,h(x)>  and  the  prime  indicates  differentiation  with 
respect  to  the  first  variable  (x) . The  conditions  (4.2)  can  be  written  more  conveniently 
as  the  generalized  equation 


0 e 


•C ' (x,u,v) 

X 

-g  (x) 

+ 

n p q 

u 

-h(x) 

V 

(4.3) 


and  we  shall  consider  the  question  of  proving  strong  regularity  at  a solution  of  (4.3). 


If  the  components  of  such  a solution  are  denoted  by  x^,  u^  and  vQ,  we  can  partition 


the  vector  g(xQ)  into  smaller  vectors  g (x^) , g (xQ)  and  g (x^) , of  dimensions 


r,  s and  t respectively,  and  partitions  uQ  conformably  into  u^,  u^,  and  so  that 


g+ (Xq)  = °-  uo  " ° ' 


g° <xo>  = °'  U0  = 0 ' 


g (xQ)  <0,  uQ  = 0 , 


where  the  ordering  is  componentwise.  The  linearization  of  (4.3)  about  the  solution  we 
are  examining  can,  after  suitable  rearrangement,  be  written  as 


0 e 


£ " 

ht 

g+t 

g°t 

g"t 

x - 

xo  ' 

' 0 

-H 

0 

0 

0 

0 

V - 

vo 

0 

-G+ 

0 

0 

0 

0 

+ 

u 

1 

C 

O + 

+ 

0 

-G° 

0 

0 

0 

0 

0 

u 

0 

uo 

0 

-G~ 

0 

0 

0 

0 

U 

• uo- 

-g’txo)- 

3i|i 


_ n q r s t 

R x R x R xR  xR 
+ + + 


(4.4) 


4 + * 

where  ■£"  denotes  £ " (Xq,Uq, v^) , ^ denotes  h'(xQ),  G denotes  g (x^)  , etc.  One 


can  check  that  the  reduced  form  of  (4.4)  is 
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V 


0 € 


T vr  > 0T1 

•C"  H G , G 

y 

y 

o 

o 

0 

X 

1 

v 

V 

-G+  0 0 | 0 

+ 

w 

+ Sip 

Fn+q+r**S 

■f 

w 

+ 

0 

0 

0 

L-g  o o 1 o J 

L u 

u 

v + + + 

where  w : = u u^;  the  dotted  lines  in  the  matrix  indicate  the  appropriate  partition- 
ing for  the  analysis  of  Section  3.  Using  the  results  of  that  section,  we  see  that 
necessary  and  sufficient  conditions  for  strong  regularity  of  (4.3)  at  the  solution  in 
question  are  that  the  matrix 

JC"  hT  G+T 


-H  0 0 
L-G+  0 0 

be  nonsingular,  and  that  its  Schur  complement 


(4.5) 


(G°  0 0) 

-x 

got 

-H  0 0 

0 

+ 

o 

o 

0 

(4.6) 


be  a P-matrix.  Of  course,  if  g (x^)  is  vacuous  then  the  matrix  in  (4.6)  does  not 
appear,  and  in  that  case  one  may  apply  the  standard  implicit-function  theorem  as  was 
done  in,  e.g.,  {31,  (41,  and  (11).  If  g^(Xg)  is  not  vacuous,  then  the  results  given 
in  this  paper  permit  one  to  carry  out  a similar  analysis  even  though  the  classical 
implicit-function  theorem  does  not  apply. 

In  the  case  in  which  g°(Xg)  is  vacuous,  it  is  well  known  that  certain  standard 
assumptions  on  the  problem  suffice  to  guarantee  nonsingularity  of  the  matrix  in  (4.5) 
above.  In  terms  of  our  notation,  these  are: 

(a)  the  second-order  sufficient  condition  (3,  Ch.  2] : For  each  non-zero  y such 

that 

G+y  = 0 
G^y  < 0 

S 

Hy  = 0 


one  has  ( y,£"y>  > 0. 
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Note  that  G°  appears  in  the  qeneral  form  of  these  conditions;  if  q 'ix^l  wore 
0 

vacuous  then  of  course  G would  not  appear. 

(b)  linear  independence  of  gradients  of  the  binding  constraints:  The  matrix 


has  full  row  rank. 


(c)  strict  complementary  slackness:  g (xQ)  is  vacuous. 

Actually,  only  (a)  and  (b)  are  required  for  nonsingularity , but  (c)  is  needed  to 
make  sure  that  (in  our  terminology)  the  reduced  form  of  the  problem  contains  only  equations. 

We  shall  show  now  that  condition  (b) , together  with  a slightly  strengthened  form 
of  (a),  will  suffice  to  guarantee  that  conditions  (1)  and  (2)  above  are  met,  and  thus 
that  (4.3)  is  strongly  regular  at  the  solution  we  are  considering.  The  strengthened 
form  of  (a)  is: 


(a*)  the  strong  second-order  sufficient  condition:  For  each  nonzero  y with 

G+y  = 0 
Hy  = 0 , 

one  has  < y,£"y>  > 0. 

Of  course,  if  g°(xQ)  is  vacuous  (i.e. , if  (c)  holds)  then  (a)  and  (a')  are  the 
same,  but  in  general  (a')  is  a stronger  requirement  than  is  (a).  With  this  change, 
we  can  prove  that  even  without  strict  complementary  slackness  the  problem  (4.3)  will 
be  strongly  regular. 

THEOREM  4.1:  Let  6,  g and  h be  functions  from  an  open  set  2 C e"  to  * , 

R^  and  Rq  respectively,  which  are  twice  differentiable  at  a point  x^  e : . Suppose 
that  xQ,  together  with  points  uQ  e R^  and  vQ  e R^,  solves  (4.3).  If  the  strong 
second-order  sufficient  condition  holds  at  ^xo,uo,vO^  together  with  linear  independence 
of  the  gradients  of  the  binding  constraints,  then  (4.3)  is  strongly  regular  there. 

PROOF:  To  prove  that  the  matrix  in  (4.5)  is  nonsingular,  suppose  that  a,  b and  e 

are  such  that 
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„ T +T 

jC"a  + H b ♦ G c = 0 


-Ha 

+ 

— G a 


= 0 
= 0 . 


(4.7) 


Premultiplying  the  equations  in  (4.7)  by  aT,  bT  and  cT  respectively  and  adding  the 
results,  we  find  that  (a,£"a>  = 0.  This,  together  with  the  second  and  third  equations 
of  (4.7)  and  the  strong  second-order  sufficient  condition,  implies  that  a = 0;  the 
first  equation  of  (4.7)  and  the  linear  independence  assumption  now  imply  that  b and  c 
are  also  zero.  Thus  the  matrix  is  nonsingular,  so  that  the  Schur  complement  shown  in 
(4.6),  which  we  shall  denote  by  S,  exists.  To  gain  some  additional  information 
about  its  structure,  we  note  that  the  equations 

JC"V  + hta  + g+tb  = got 


-HV 

-G+V 


= 0 
= 0 


(4.8) 


uniquely  define  matrices  V (n  x s) , A (q  x s)  and  B(r  » s) . We  then  have 


S = [G“  0 0) 


= G°V  , 


V 
A 
B 

but  upon  premultiplying  the  first  equation  of  (4.8)  by  V1  we  find  (since  X " is 
T 

symmetric)  that  S - V £"V.  Thus  S is  symmetric,  so  it  will  be  a P-matrix  if  and 
only  if  it  is  positive  definite.  Suppose  that  z e RS  with  <z,Sz>  < 0.  Then  with 
y :=  Vz,  we  have  (y,£"y)  < 0 and  (from  (4.8))  Hy  = 0 and  G+y  = 0.  By  the  strong 

second-order  sufficient  condition  we  must  now  have  y = 0.  Postmultiplying  the  first 

T +t  0 

equation  of  (4.8)  by  z and  using  Vz  = 0 we  have  H Az  + G Bz  = G z,  which  implies 

z = 0 by  the  linear  independence  assumption.  This  completes  the  proof. 

It  is  clear  that  the  conditions  of  Theorem  4.1,  although  sufficient,  are  not  in 

general  necessary  for  strong  regularity  (consider  the  problem  of  minimizing  the 
2 

scalar  function  -£  with  no  constraints  and  let  £=01).  It  may  not  be  so  clear 
whether  we  could  have  used  the  standard  second-order  sufficient  condition  (as  q.ven  in 


-21- 


i 

i 


(a)  above)  in  the  hypothesis  of  Theorem  4.1  instead  of  the  somewhat  stronger  condition 
(a')*  To  see  that  this  cannot  be  done/  consider  the  following  example: 

1,2  2. 

minimize  - (x^  - x2)  - px^ 


subject  to  -x^  + 2x^  < 0 


(4.9) 


-*1  - 2X2  S ° * 

where  p represents  a perturbation  parameter.  We  can  write  the  necessary  optimality 
conditions  for  (4.9)  as  the  linear  generalized  equation 


0 e 


1 0 -1  -1 

*1 

-p 

V 

0-1  2-2 

X2 

0 

X2 

+ 

+ 3d; 

1-200 

u. 

0 

2 2 

u 

1 

F *F 

+ 

1 

12  0 0 

-U2- 

0 

-U2- 

(4.10) 


and  we  see  that  for  p = 0,  (4.10)  is  already  in  reduced  form  with  its  unique  solution 

4 

at  the  origin  in  E . However,  for  each  positive  p there  are  three  solutions,  as 
follows: 


(saddle  point) 

(local  minimum) 

(local  minimum)  . 

By  making  p sufficiently  small,  all  of  these  solutions  can  be  brought  within  any 
preassigned  neighborhood  of  the  origin.  It  follows  that  (4.10)  is  not  strongly  regular 
there;  however,  one  can  verify  easily  that  the  solution  of  (4.9)  for  p = 0 satisfies 
both  the  standard  second-order  sufficient  condition  and  the  linear  independence  condi- 
tion. Those  conditions  would  therefore  not  suffice  to  establish  Theorem  4.1. 

We  have  dealt  with  the  nonlinear  programming  problem  in  the  form  (4.1)  because 
that  form  (with  no  implicit  constraints  on  x)  is  frequently  seen  in  the  literature. 


However,  it  is  perhaps  worth  pointing  out  here  that  for  some  problems  one  may  do  much 
better  by  formulating  them  with  implicitly-constrained  variables.  For  example,  consider 
the  problem  of  minimizing  the  expression  <c,x>  + a||x||  for  x e F^,  where  ||  - 1| ^ 
is  the  maximum  absolute  value  of  the  components  of  x.  If,  for  example,  we  take 
c = [j,  o]  and  a = 1,  one  way  to  formulate  this  problem  using  explicit  linear 
constraints  is: 


minimize  n + — x^ 

Vx2,n 


subject  to  I x 


- eg  < 0 , 


where  I denotes  the  identity  matrix  and  e is  a vector  of  ones.  The  solution  is 

evidently  n = 0,  x = 0,  and  all  four  constraints  are  binding;  thus  one  cannot  use 

Theorem  4.1  to  show  that  the  problem  is  strongly  regular.  In  fact,  this  problem  is  not 

T 

strongly  regular:  if  the  vector  0 is  perturbed  to  (-£,  0,  -£>  0)  for  £ > 0, 
then  the  optimal  solution  set  becomes  {(0,A,e)|X  e [-e,E]  . However,  if  one  formulates 


the  problem  as 


minimize  n + — x 
xrx2,n 


subject  to  x e C :=  xl  h > ||x| 


then  it  can  be  expressed  by  the  generalized  equation 


0 e 0 + 3^c  x 2 . 


and  the  reduction  procedure  alone  suffices  to  show  that  (4.13)  is  strongly  regular  (see 
the  Appendix;  here  the  reduced  form  is  vacuous,  since  in  this  case  the  face  F is 
just  the  origin).  Of  course,  this  problem  can  be  solved  by  inspection,  but  it 


conveniently  illustrates  the  point  that  formulation  can  mak<  a critical  d if f'-reri' ••  in 


I. 

F 


the  stability  properties  of  a problem.  Here  the  formulation  in  (4.11)  makes  even  the 
structure  of  the  function  ||  * ||  subject  to  perturbation,  whereas  in  the  original 
problem  the  data  subject  to  perturbation  miqht  reasonably  have  been  only  ot  and  the 
components  of  c.  This  feature  of  the  problem  is  appropriately  reflected  in  the 


\ 


APPENDIX:  Conversion  of  a Linear  Generalized  Equation  to  Reduced  Form. 

We  shall  deal  here  with  the  linear  generalized  equation 

Of  Ax  + a + 3^  (x)  , (A. 

where  C is  a nonempty  polyhedral  convex  set  in  Rn,  A is  n * n and  a « Kn . If 
xq  is  a solution  of  (A.l) , we  shall  show  that  in  the  qeneral  case  one  can  find  integers 
r and  s,  with  r + s < n,  a pointed  polyhedral  cone  K C RS  of  dimension  s, 
and  an  (r  + s)  x (r  + s)  matrix  B,  such  that  rhere  is  a natural  correspondence 
between  solutions  of 


ye  Ax  + a + B^lx)  (A. 

for  small  y and  for  x near  x^,  and  solutions  of  the  reduced  form 

z e Bw  + 3i(i  (w)  (A. 

Krxx 

k £ 

for  z and  w near  zero.  In  the  case  C = K x R+  this  is  clear,  as  has  already 

been  remarked  in  the  main  part  of  the  paper:  it  corresponds  to  removal  of  the  "inactive 

constraints."  In  other  special  cases,  either  K or  Rr  (or  both)  might  be  vacuous, 

so  that  the  problem  would  become  even  simpler.  For  the  general  case,  we  need  some 

properties  of  polyhedral  convex  sets,  which  we  shall  list  here  without  proof.  We  use 

the  notation  T (x  ) for  the  tangent  cone  to  C at  x„  (i.e.,  for  3i|i„(x„)  ° ) ; for 
v-  v.1  0 CO 

general  information  about  tangent  cones,  polyhedrality , etc.,  see  [13].  In  all  of  the 
following  statements,  C is  a nonempty  polyhedral  convex  set  in  Rn. 

PROPOSITION  A.l:  Let  x0  e C.  Then  there  exists  a neighborhood  U of  the  origin 
such  that  (C  - xQ)  n u = Tctxo)  n u> 

PROPOSITION  A. 2:  Let  yQ  e Rn,  and  let  F :=  (-y^)  . Then  for  each  x e F, 

3^f<x)  = 3^c(x)  + yQR+  =:  {y  + ayQ|y  e Sij/^Cx)  ,a  > 0}.  Further , for  each  y near  y , 
3**(-y)  C F. 

The  next  proposition  provides  the  key  to  establishing  a correspondence  between 
solutions  of  (A. 2)  and  those  of  a problem  equivalent  to  the  reduced  form  (A. 3). 


PROPOSITION  A. 3:  Let  yQ  and  F be  as  in  Proposition  A. 2.  Let  xQ  e F>  and 
write  T :=  Tpfx^).  Let  L be  the  subspace  parallel  to  F.  Then  there  exist  neighbor- 
hoods U and  V of  the  origin,  such  that  for  each  h e U and  each  k e V , 


if  and  only  if 


0 e (yQ  + k)  + 3i|/c(x0  + h) 


0 e P^k  + 3^T (h) 


where  P^  denotes  the  orthogonal  projector  on  L . 

PROOF:  Applying  Proposition  A.l  to  F,  we  can  find  an  open  neighborhood  U of 
the  origin  such  that  (F  - x^)  n u = T n U.  By  Proposition  A. 2,  we  can  find  a neighbor- 
hood VQ  of  the  origin  such  that  for  each  k e VQ,  if  0 e (y^  + k)  + 3^c<x)  then 

x e F.  Also,  there  is  a neighborhood  of  zero  such  that  for  any  x e F, 

(3*c(x)  + yQ®+)  n C 3^c(x)  + yQ10, 11  :=  (y  + “yQ|y  e . o < a < 1}  . 

To  see  this,  note  that  3t|»c(-)  has  only  finitely  many  values  on  F;  for  each  of  these 

values,  say  a polyhedral  convex  cone  P,  let  a be  a simplex  containing  the  origin 

in  its  interior.  The  set  (P  + y^R+)  n o is  a bounded  polyhedral  convex  set,  so  it 

can  be  written  as  the  convex  hull  of  points  q^, ,q^  in  Hn.  For  each  i, 

q.  = p.  + a.y  . with  p.  e p and  a.  > 0.  With  a_  :=  a/max{l,a,  , . . . ,a  } it  follows 

i i i 0 i 1=  0 lm 

that  (P  + yQR+)  n 0^  C p + y^IO.ll.  Repeating  this  procedure  for  each  value  of  3i|'c ( * ) 
on  F and  intersecting  the  resulting  simplices,  we  obtain  the  required  . Finally, 
we  let  V :=  Vq  H ( —v ^ ) . Now  choose  any  h e u and  k e v. 

(only  if)  : Suppose  0 e (y  + k)  + ^^x^  + h)  . As  k e VQ,  we  have  x^  + h e F, 

and  thus  he  (F  - xQ)  n U,  so  actually  h e T.  Let  t e T;  we  shall  prove  that 

< P^k.t  - h>  > 0.  To  begin  with,  note  that  we  may  replace  t - h by 

Aft  - h)  = [(1  - A)h  + > 1 1 - h for  any  small  positive  A;  thus  we  lose  no  generality 

by  assuming  that  t e THU  (recall  U was  open,  so  he  int  U) . Then  x^  * t e F, 


and  of  course  xQ  + h e F , so 


t - h = (x^  + t)  - (xQ  + h)  e L. 
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As  is  orthogonal  to  L by  construction,  we  have  < y^  + P ^k,t  - h>  = 0.  There- 

fore, since  k = P k + P ,k,  we  have 
L L‘ 

< P^k.t  - h > = < yQ  + k,  (xQ  + t)  - (xQ  + h)  > > 0 , 

since  we  assumed  that  0 e (y^  + k)  + 3^^ ^xo  + W • It  follows  that  0 e p^k  + 3i|<T(h)  . 

(if):  Suppose  that  0 e p^k  + 3U'T (h)  . As  h e T <">  U,  we  have  x^  + h e F;  as 

h « int  U we  have  also  that  (h)  = (x^  + h)  = 3 ^(Xq  + h)  + ygR+,  where  we 

have  used  Propositions  A.l  and  A. 2.  Noting  that  l/  C 3i(i  (x_  + h)  , we  see  that  since 

t U 

k » P k + P ,k, 

L L1 

0 e PLk  + 3'f'p(x0  + h)  * k + 3«^F  (xQ  + h)  = k + yQK+  + 3*c(x0  + h)  . 

But  then  -k  e (3^(x^  + h)  + yQF+)  n v , so  for  some  a f (0,11, 

-k  - oyQ  e 3^c  <xQ  + h)  , 

and  since  xQ  + h f F,  we  have 

-y0  * »Vxo  + h)  • 

Multiplying  the  second  inclusion  by  (1  - a)  and  adding  it  to  the  first  we  obtain 

Of  (yQ  + k)  + 3*c(x0  + h)  , 

which  completes  the  proof. 

Using  Proposition  A. 3,  we  can  now  construct  the  reduced  form  of  (A.l)  for  general 

polyhedral  C.  To  do  so,  suppose  that  xQ  solves  (A.l).  Define  y^  :=  AxQ  + a, 

and  let  F,  T and  L be  defined  as  above.  Let  M be  the  lineality  space  of  T and 

choose  orthonormal  bases  b, ,...,b  for  M,  b , , . . . ,b  for  L n , and 

1 r r+1  r+s 

br+s+l' ‘ ‘ ‘ ,bn  f°r  L*  <note:  we  assume  here  for  generality  that  none  of  these  spaces 
is  of  zero  dimension;  if  one  or  more  are  zero-dimensional,  the  analysis  is  only 

simplified) . Assume  that  A and  all  vectors  in  Bn  are  written  in  terms  of  the 

12  3 1 

basis  b, , . . . ,b  ; we  shall  write,  for  example,  x = (x  ,x  ,x  ) with  x e M, 

1 n 

x2r  L O m\  and  x'*  f L*.  We  can  write  T = M + (T  n M'S  , and  the  cone  K :=  T n M^ 
is  pointed  and  contained  in  L n M^;  in  fact,  since  L = aff  T,  K has  the  dimension 
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we  have 


of  L ^ M . Thus,  if  h f 


h =■  <hl,h2,0)  , h1  f Kr,  h 2 f K i 

3*T<h)  = {0}  X 3||/K(h2)  X I<n'r-S  . 

Now  apply  Proposition  A. 3,  with  yQ  :=  AxQ  + a,  to  obtain  neighborhoods  U and  V 
of  the  origin;  construct  additional  neighborhoods  N of  x^  and  w of  0 such  that 
if  x F n and  y F w then 


h :=  x - xQ  F U 


Choose  any  x F N and  any  y E W.  Now  Proposition  A. 3 tells  us  that  (A. 2)  holds  for 
x and  y if  and  only  if 


Writing 


2 2 

y=y  , h = h , a=a 


a) 

= Ah 

- y 

e V . 

A. 3 

tells  us 

that  (A 

(h)  . 

‘ll 

A12 

*13 

21 

*22 

*23 

' 

k31 

*32 

*33- 

we  see  that  (A. 4)  is  equivalent  to 


ii 

*12 

21 

*22- 

3*K(h  ) 


*11  *12 


*21  A2 


we  see  that  the  second  relation  in  (A. 5)  is  of  the  form  shown  in  (A. 3).  We  have  thus 
established  the  required  correspondence. 

This  construction,  together  with  Proposition  A. 3,  now  permits  us  to  state  a 
general  criterion  for  strong  regularity  of  the  generalized  equation  (A.l). 

THEOREM  A. 4:  Let  C be  a polyhedral  convex  set  in  let  A be  n * n and 

a € Suppose  that  € J*n  solves  (A.l).  Then  (A.l)  is  strongly  regular  at  x ( 


4 


if  and  only  if  its  reduced  form  (i)  is  vacuous,  or  (ii)  has  an  inverse  which  is  single- 
valued  on  all  of  L. 

PROOF:  We  assume  the  notation  used  in  the  earlier  part  of  the  appendix. 

(if):  First,  suppose  the  reduced  form  of  (A.l)  is  vacuous.  This  means  that 
L = {0},  so  that  F is  the  singleton  {x^}  and  T = fOl.  Proposition  A. 3 now  implies 
that  for  any  y near  zero,  x^  itself  is  the  unique  solution  of  (A. 2)  near  xQ. 

Thus  (A.l)  is  strongly  regular.  On  the  other  hand,  suppose  that  the  inverse  of  the 
reduced  form  is  single-valued  on  all  of  Pn  (and  therefore  Lipschitzian  there,  as 
we  have  previously  pointed  out) . Consider  the  neighborhoods  W and  N constructed 
above,  and  find  a neighborhood  W„  of  the  origin  with  W„  C w and  having  the  property 

r„n  o rui  0 

satisfy  the  second  relation  in  (A. 5), 


that  if  ye 

W and  if 

v1! 

2 

u 

and  some 

hr 

,2 

hr 

Ly  J 

_h 

then  Xg  + 


e N (here  we  have  used  the  continuity  of  the  reduced  form's  inverse). 


Choose  any  y e WQ;  by  hypothesis  the  second  relation  in  (A. 5)  is  solvable  by  a unique 


, we  have  x e N,  and  therefore  x and  y solve  (A. 2) . 

h. 


h. 

h 

1 

1 

. With  x :=  + 

-h2- 

0 

2 

0 _ 

Further,  the  solution  x is  unique  in  N because  the  solution 


ch2J 


of  (A. 5)  is 


unique.  Thus  the  restriction  to  WQ  of  B n (A(")  + a + 3<ic(-)l-^  is  a single-valued 
function;  the  Lipschitzian  property  now  follows  from  [12,  Prop.  21  and  our  earlier 
remarks.  Therefore  (A.’)  is  strongly  regular  at  xQ. 

(only  if):  Suppose  that  (A.l)  is  strongly  regular  at  xQ,  and  that  its  reduced 
form  is  non-vacuous;  choose  neighborhoods  S of  0 and  Q of  xQ  so  that  for  each 
yes,  UK. 2)  has  a solution  x e Q which  is  unique  in  Q (here  we  are  again  usinq 

fy,' 


continuity).  Let  :=  S W and  Qq  :=  Q <”>  N,  and  choose  any 


1 

l y# 


in  the  projection 


of  Sg  on  L (a  neighborhood  of  zero  in  L) . Let  y e SQ  with  P^y  = 


By 


Lv2J 


construction,  a solution  x e Q exists  for  (A. 2);  define  h :=  x - xn-  the  discus- 

sion preceding  this  theorem,  we  see  that  h ^ = 0 and  that  | *|  and  | *|  satisfy 


. - 

x - x^ . 

By 

N 

0 

V 

and 

■h2- 
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* 


the  inclusion  in  (A. 5) . Let  G := 

the  origin  in  L,  and  it  contains 

fh; 


f Q - xr 


; this  is  a neighborhood  of 


(A. 5)  in  G,  then  xQ  + 


If  there  were  another  solution 


of 


would  belong  to  Q and  would  solve  (A. 2),  contradicting 


strong  regularity.  Therefore  the  reduced  form  of  (A.l)  is  uniquely  solvable  for  each 

ry,"! 

in  the  neighborhood  P^S  ; but  the  reduced  form  is  positively  homogeneous,  so 


it  is  uniquely  solvable  for  each 


1 

Ly2J 


£ L.  This  completes  the  proof. 
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